CDS

Accession Number TCMCG042C65672
gbkey CDS
Protein Id XP_016497808.1
Location complement(join(28026..28228,29909..30052,30350..30465,31220..31338,31572..31696,31800..31886,32335..32485,32573..32749,33623..33670,34597..34881))
Gene LOC107816596
GeneID 107816596
Organism Nicotiana tabacum

Protein

Length 484aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA319578
db_source XM_016642322.1
Definition PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like [Nicotiana tabacum]

EGGNOG-MAPPER Annotation

COG_category M
Description procollagen-lysine 5-dioxygenase activity
KEGG_TC -
KEGG_Module -
KEGG_Reaction R07376        [VIEW IN KEGG]
KEGG_rclass RC00017        [VIEW IN KEGG]
RC02950        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K08730        [VIEW IN KEGG]
EC 2.7.8.29        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00564        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
map00564        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGATCAGAGAAGGGAGGCACAAGCTAACGAGATTAATGGGAGTCAGAGCAATGAAAACGACGGCGTTTCATCGAAGGGGCCAGCGCTGCGGCTGTACCCATGTGTAGAGCAGAAGGCGGAGAACTATGAGGATTTAGAAGAAGAATTGGAGTTCAGCCCACACCTATATAGTGCTCTTGAGCGGCATCTTCCGAGCAGCGTACTCAGTTCATCTCGAGACAACAAGGTCCAATACATGACTGATATTCTCCTCCGTTACTCTCCCCGCGCCGACCGCAGTCGCTTGCAGAAACATGGAGAATACAGGCAGAAAATCATATCAAACTATCAGCCTTTACACAGGGTGTTATATACCATGCACGCCGCAGATTTCTTTGTACCGTCGTTTATTAAGGCGATCAGTGAGAATAAGGAGGAAAGCTTCAGAAAAATAATGTCTGAACCTTCTCCAGGTGTTTTTACATTTGAAATGCTTCAACCGCGTTTTTGTGAGATGATGTTGGCTGAGGTACAAAACTTTGAGAAGTGGGTTCGTGAAACAAAATTCAGAATCATGCGTCCCAATACTATGAACAAATTTGGAGTTGTTCTTGATGACTTTGGCCTTCAAAACATGCTTAAGAAGTTTATGGAAGATTTTATACGCCCTATTTCAACAGTTTTTTTTACTGAAGTTGGTGGATCCACACTCGATGGTCATCATGGTTTTGTCGTTGAGTATGGAACAGACAGAGACATTGACTTGGGTTTCCATGTTGATGATGCGGAGGTCACTTTGAATGTGTGCTTAGGAAAGCAATTCACAGGTGGAGAGTTGTTCTTCCGGGGCGTACGGTGCGAGAAGCATGTGAACTCTGATACACAGCCAGATGAGATCTTTGATTATGTGCATGTTGCGGGGCGTGCGATTCTACATTGTGGTCGCCATAGGCATGGTGCTAGAGCGACAACATATGGGCAGAGGATCAACTTGTTGATATGGTGCAGAAGCTCTGTTTTCAGAGAGACGAGGAAGTACCAAACAGATTTTCCAAGCTGGTGTGCAGAGTGCAAGCGTGAGAAGGAAGAAAGGATACGGCAAAAAGCTTCTATTCTCAAATCGGTAGGAGTTGCGCCTGAGAACTGTATAGCTATTTTTCTTATCCATTTTCAAGAGTTGCTGAGAAGAGTTGGAGAATGTGAAACTTTACGGCGTAAATCCGAATTTGATATGAGTTTTAATATGGGTATCGCAAGAAAATCGCGAAAAGGCATCCTCAACAAGCCCGCCCGGAACACGGCTACAGAAAATGCAAAGACTTGCCCTATACAACATGCAAAAGAGCAGTGCAACCAACTCACTCCTCCAACGACAGGTATTACTCACAATGTCATTGATAGTGCAGGTGACAACGCCCTCGCAGCCATTCTGAAAAGGATGGAGGAAATGGAGAACAAGAACAAGGCACTCTGA
Protein:  
MDQRREAQANEINGSQSNENDGVSSKGPALRLYPCVEQKAENYEDLEEELEFSPHLYSALERHLPSSVLSSSRDNKVQYMTDILLRYSPRADRSRLQKHGEYRQKIISNYQPLHRVLYTMHAADFFVPSFIKAISENKEESFRKIMSEPSPGVFTFEMLQPRFCEMMLAEVQNFEKWVRETKFRIMRPNTMNKFGVVLDDFGLQNMLKKFMEDFIRPISTVFFTEVGGSTLDGHHGFVVEYGTDRDIDLGFHVDDAEVTLNVCLGKQFTGGELFFRGVRCEKHVNSDTQPDEIFDYVHVAGRAILHCGRHRHGARATTYGQRINLLIWCRSSVFRETRKYQTDFPSWCAECKREKEERIRQKASILKSVGVAPENCIAIFLIHFQELLRRVGECETLRRKSEFDMSFNMGIARKSRKGILNKPARNTATENAKTCPIQHAKEQCNQLTPPTTGITHNVIDSAGDNALAAILKRMEEMENKNKAL